Picture for Kai Chen

Kai Chen

Tony

RoboSemanticBench: Diagnosing Semantic Grounding in Action Prediction for VLA Models

Add code
Jun 01, 2026
Viaarxiv icon

Geometry-Guided Modeling of Foundation Features Enables Generalizable Object Shape Deformation Learning

Add code
May 28, 2026
Viaarxiv icon

MRMMIA: Membership Inference Attacks on Memory in Chat Agents

Add code
May 27, 2026
Viaarxiv icon

What and When to Distill: Selective Hindsight Distillation for Multi-Turn Agents

Add code
May 19, 2026
Viaarxiv icon

OpenCompass: A Universal Evaluation Platform for Large Language Models

Add code
May 19, 2026
Viaarxiv icon

Beyond Mode Collapse: Distribution Matching for Diverse Reasoning

Add code
May 19, 2026
Viaarxiv icon

A Deterministic Agentic Workflow for HS Tariff Classification: Multi-Dimensional Rule Reasoning with Interpretable Decisions

Add code
May 14, 2026
Viaarxiv icon

IntentVLA: Short-Horizon Intent Modeling for Aliased Robot Manipulation

Add code
May 14, 2026
Viaarxiv icon

FrameSkip: Learning from Fewer but More Informative Frames in VLA Training

Add code
May 13, 2026
Viaarxiv icon

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

Add code
May 11, 2026
Viaarxiv icon